log 2
Country:
- Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
- Antarctica (0.04)
Industry:
- Education > Educational Setting > Online (0.47)
- Information Technology > Security & Privacy (0.46)
- Transportation > Air (0.40)
Country:
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)
- Asia > China (0.04)
Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
- Information Technology > Data Science (0.92)
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)
Country:
- Asia > China > Shanghai > Shanghai (0.05)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.04)
Technology:
Country:
- Europe > France > Hauts-de-France > Nord > Lille (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > France > Auvergne-Rhône-Alpes > Lyon > Lyon (0.04)
- (2 more...)
Genre:
- Workflow (0.93)
- Research Report > New Finding (0.46)
Industry:
- Information Technology > Security & Privacy (1.00)
- Health & Medicine (1.00)
Technology:
Country:
- Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
- Asia > China (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.68)
Country:
- Asia > Afghanistan > Parwan Province > Charikar (0.05)
- Europe > United Kingdom > Scotland > City of Edinburgh > Edinburgh (0.04)
- Asia > Middle East > Jordan (0.04)
Technology:
Country:
- North America > United States > California (0.04)
- Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
Technology:
Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Africa > Eswatini > Manzini > Manzini (0.04)
Technology:
Provably Safe Reinforcement Learning with Step-wise Violation Constraints
We name this problem Safe-RL-SW . Our step-wise violation constraint differs from prior expected violation constraint (Wachi & Sui, 2020; Efroni et al., 2020b; Kalagarla et al., 2021) in two aspects: (i) Minimizing the step-wise violation enables the agent to learn an optimal policy that avoids unsafe regions deterministically,
Country:
- North America > United States > Illinois (0.04)
- Asia > China (0.04)